Book Review: Machine-Aided Linguistic Discovery: An Introduction and Some Examples by Vladimir Pericliev

نویسنده

Eric Smith

چکیده

The subtitle of Vladimir Pericliev's book, An Introduction and Some Examples, is a succinct and accurate description of its contents. Pericliev argues briefly for the usefulness of computer-aided techniques in linguistic discovery, contrasting it with the intuitionist approach which has characterized linguistic discovery throughout much of its history. The bulk of the book is devoted to examples of software-aided linguistic discovery drawn from his own work. Chapter 1 starts by sketching out the current state of discovery techniques in linguistic theory, categorizing scientific discovery into three main approaches: the intui-tionist approach, the chance approach, and the problem-solving approach. Discoveries by intuition and by chance remain the purview of humans, but clearly the problem-solving approach can benefit from the application of computational techniques. Chapter 2 presents the KINSHIP program, which performs " parsimonious discrimination " in order to determine the minimal set of features which are necessary to discriminate all of a language's kinship terms. The program is used to discover feature geometries, superior to existing human-discovered ones, which describe the kinship terminology of languages like English and Bulgarian. Chapter 3 extends the ideas used in KINSHIP to a program called MPD (maximal parsimonious discrimination), which is then applied to a variety of other tasks, some of which are unconnected to linguistics. Of these applications, the most interesting is the use of MPD to determine the segment profiles which uniquely identify languages in the UPSID-451 database (consisting of segment inventories from 451 languages, selected to provide broad coverage of the world's language families) (Maddieson and Precoda 1991). Although Pericliev discusses his results at considerable length, it is not clear what the theoretical usefulness of these profiles might be. What does it really tell us about French to know that it is the only language in the database to contain the phoneme [ ˜ oe]? Of more practical interest was Pericliev's discussion of the process of converting the UPSID data into a featural representation to make it amenable to processing, describing how to represent underspecified segments and how to deal with transcription variations. This sort of necessary preprocessing constitutes an important and underemphasized part of the process of machine-aided linguistic discovery. The study of UPSID does produce some interesting, though not unexpected, results. For instance, when a profile contains more than one unique segment, the majority of these segments share a common feature, and 85.8% of the unique segments have some sort …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Appraisal of UNIVAUTO? The First Discovery Program to Generate a Scientific Article

In a companion paper ([14]), I describe UNIVAUTO (UNIVersals AUthoring TOol), a linguistic discovery program that uncovers language universals and can write a report in English on its discoveries. In this contribution, the system is evaluated along a number of parameters that have been suggested in the literature as necessary ingredients of a successful discovery program. These parameters inclu...

متن کامل

A Linguistic Discovery Program that Verbalizes its Discoveries

We describe a discovery program, called UNIVAUTO (UNIVersals AUthoringTOol), whose domain of application is the study of language universals, a classic trend in contemporary linguistics. Accepting as input information about languages, presented in terms of feature-values, the discoveries of another human agent arising from the same data, as well as some additional data, the program discovers th...

متن کامل

Empirical Discovery in Linguistics

A discovery system for detecting correspondences in data is described, based on the familiar induction methods of J. S. Mill. Given a set of observations, the system induces the “causally” related facts in these observations. Its application to empirical linguistic discovery is described. The paper is organized as follows. I begin the discussion by revealing two developments, the transformation...

متن کامل

Drug Discovery Acceleration Using Digital Microfluidic Biochip Architecture and Computer-aided-design Flow

A Digital Microfluidic Biochip (DMFB) offers a promising platform for medical diagnostics, DNA sequencing, Polymerase Chain Reaction (PCR), and drug discovery and development. Conventional Drug discovery procedures require timely and costly manned experiments with a high degree of human errors with no guarantee of success. On the other hand, DMFB can be a great solution for miniaturization, int...

متن کامل

Computer Enumeration of Significant Implicational Universals of Kinship Terminology

The discovery of general patterns and their subsequent explanation is a familiar method in linguistics and other cross-cultural research. This paper addresses the computerized enumeration of significant cultural and linguistic patterns, specifically implicational universals. We dispute published suggestions that the mechanical generation of universals is inadvisable, by arguing that such claims...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Book Review: Machine-Aided Linguistic Discovery: An Introduction and Some Examples by Vladimir Pericliev

نویسنده

چکیده

منابع مشابه

An Appraisal of UNIVAUTO? The First Discovery Program to Generate a Scientific Article

A Linguistic Discovery Program that Verbalizes its Discoveries

Empirical Discovery in Linguistics

Drug Discovery Acceleration Using Digital Microfluidic Biochip Architecture and Computer-aided-design Flow

Computer Enumeration of Significant Implicational Universals of Kinship Terminology

عنوان ژورنال:

اشتراک گذاری